Towards a Better Understanding of Random Forests through the Study of Strength and Correlation
نویسندگان
چکیده
In this paper we present a study on the Random Forest (RF) family of ensemble methods. From our point of view, a ”classical” RF induction process presents two main drawbacks : (i) the number of trees has to be a priori fixed (ii) trees are independently, thus arbitrarily, added to the ensemble due to the randomization principle. Hence, this kind of process offers no guarantee that all the trees will well cooperate into the same committee. In this work we thus propose to study the RF mechanisms that explain this cooperation by analysing, for particular subsets of trees called sub-forests, the link between accuracy and properties such as Strength and Correlation. We show that these properties, through the Correlation/Strengh2 ratio, should be taken into account to explain the sub-forest performance.
منابع مشابه
PREDICTING CLUSTER B PERSONALITY DISORDER ACCORDING TO FIVE FACTOR ALTERNATIVE MODELS ZUCKERMAN- KUHLMAN AND EGO STRENGTH
Abstract Background& Aims:Due to the wide range of personality disorders and as well as alternative model DSM-5 for personality disorders, this study aimed to cluster B personality disorder according to five factor alternative models Zuckerman- Kuhlman and ego strength. Method:The study population is included all students of University of MohegheghArdabili in 2015(N=14000). A descriptive...
متن کاملThe study of the relationship between quality of life and health literacy among students of Abadan Faculty of Medical Sciences
Introduction: Health literacy is an essential element in people's ability to contribute to health-related activities, health decisions, and the ability to prevent illness and lifestyle. Today, health literacy is a global issue because many unpleasant health outcomes result from inadequate health literacy. The purpose of this study was to investigate the relationship between quality of life and ...
متن کاملتحلیل رابطه بین محیط داخلی و موفقیت سازمانی در بیمارستان
Background: Today's Hospitals operate in an inconstant and competitive environment. To have a successful presence in this environment, there is a need to recognize their own strengths and weakness points which can design appropriate strategies towards. The purpose of this study was to assess the internal environment of a hospital based on Weisbord model and analyze its relation with organizatio...
متن کاملThe Relationship between Attitude toward Research and Research Self-efficacy in MSc Students of Medical Sciences
Introduction: Positive attitude towards research can increase the students’ interest in research. Research self-efficacy is an effective factor on attitude towards research. It seems that higher research self-efficacy can influence students’ interest in research. Therefore, this study was conducted to determine the relationship between attitude to research and research self-efficacy in MSc stud...
متن کاملI-34: NRY Haplotype Analysis: towards A Better Understanding of The Genetic Basis of Spermatogenic Failure
It has been established that the Y chromosome carries genes required for spermatogenesis and male fertility. For many decades worldwide screening for gene identification has been conducted in research laboratories. However, it has been a difficult process in identifying such genes (i.e. causative mutations) which could explain the phenotypic variation and could be potentially used as markers fo...
متن کامل